A hierarchical finite mixture model that accommodates zero-inflated counts, non-independence, and heterogeneity.

نویسندگان

  • Charity J Morgan
  • Mark F Lenzenweger
  • Donald B Rubin
  • Deborah L Levy
چکیده

A number of mixture modeling approaches assume both normality and independent observations. However, these two assumptions are at odds with the reality of many data sets, which are often characterized by an abundance of zero-valued or highly skewed observations as well as observations from biologically related (i.e., non-independent) subjects. We present here a finite mixture model with a zero-inflated Poisson regression component that may be applied to both types of data. This flexible approach allows the use of covariates to model both the Poisson mean and rate of zero inflation and can incorporate random effects to accommodate non-independent observations. We demonstrate the utility of this approach by applying these models to a candidate endophenotype for schizophrenia, but the same methods are applicable to other types of data characterized by zero inflation and non-independence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zero-inflated Poisson regression mixture model

Excess zeros and overdispersion are commonly encountered phenomena that limit the use of traditional Poisson regression models for modeling count data. The focus of this paper is on modeling count data in the case that a population has excess zero counts and also consists of several sub-populations in the non-zero counts. The proposed zero-inflated Poisson regression mixture model accounts for ...

متن کامل

Marginalized mixture models for count data from multiple source populations

Mixture distributions provide flexibility in modeling data collected from populations having unexplained heterogeneity. While interpretations of regression parameters from traditional finite mixture models are specific to unobserved subpopulations or latent classes, investigators are often interested in making inferences about the marginal mean of a count variable in the overall population. Rec...

متن کامل

Explaining Heterogeneity in Risk Preferences Using a Finite Mixture Model

This paper studies the effect of the space (distance) between lotteries' outcomes on risk-taking behavior and the shape of estimated utility and probability weighting functions. Previously investigated experimental data shows a significant space effect in the gain domain. As compared to low spaced lotteries, high spaced lotteries are associated with higher risk aversion for high probabilities o...

متن کامل

Assessment of length of stay in a general surgical unit using a zero-inflated generalized Poisson regression

Background: The effective use of limited health care resources is of prime importance. Assessing the length of stay (LOS) is especially important in organizing hospital services and health system. This study was conducted to identify predictors of LOS among patients who were admitted to a general surgical unit.    Methods: In this cross-sectional study, the sample included all patien...

متن کامل

Quantifying the impact of inter-site heterogeneity on the distribution of ChIP-seq data

Chromatin Immunoprecipitation followed by sequencing (ChIP-seq) is a valuable tool for epigenetic studies. Analysis of the data arising from ChIP-seq experiments often requires implicit or explicit statistical modeling of the read counts. The simple Poisson model is attractive, but does not provide a good fit to observed ChIP-seq data. Researchers therefore often either extend to a more general...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics in medicine

دوره 33 13  شماره 

صفحات  -

تاریخ انتشار 2014